Search CORE

17 research outputs found

A Formalization of The Natural Gradient Method for General Similarity Measures

Author: G Raskutti
M Agueh
N Parikh
NN Schraudolph
SI Amari
SI Amari
W Li
Y Saad
Publication venue
Publication date: 01/01/2019
Field of study

In optimization, the natural gradient method is well-known for likelihood maximization. The method uses the Kullback-Leibler divergence, corresponding infinitesimally to the Fisher-Rao metric, which is pulled back to the parameter space of a family of probability distributions. This way, gradients with respect to the parameters respect the Fisher-Rao geometry of the space of distributions, which might differ vastly from the standard Euclidean geometry of the parameter space, often leading to faster convergence. However, when minimizing an arbitrary similarity measure between distributions, it is generally unclear which metric to use. We provide a general framework that, given a similarity measure, derives a metric for the natural gradient. We then discuss connections between the natural gradient method and multiple other optimization techniques in the literature. Finally, we provide computations of the formal natural gradient to show overlap with well-known cases and to compute natural gradients in novel frameworks

arXiv.org e-Print Archive

Crossref

Copenhagen University Research Information System

Customised ensemble methodologies for deep learning: boosted residual networks and related approaches

Author: Alan Mosca
B Malakooti
BD Ripley
C Bishop
D Whitley
F Wilcoxon
George D. Magoulas
I Mukherjee
L Breiman
M Friedman
NN Schraudolph
O Russakovsky
RE Schapire
S Płaczek
T Hastie
TG Dietterich
Y Freund
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/12/2018
Field of study

This paper introduces a family of new customised methodologies for ensembles, called Boosted Residual Networks (BRN), which builds a boosted ensemble of Residual Networks by growing the member network at each round of boosting. The proposed approach combines recent developements in Residual Networks - a method for creating very deep networks by including a shortcut layer between different groups of layers - with Deep Incremental Boosting, a methodology to train fast ensembles of networks of increasing depth through the use of boosting. Additionally, we explore a simpler variant of Boosted Residual Networks based on Bagging, called Bagged Residual Networks (BaRN). We then analyse how the recent developments in Ensemble distillation can improve our results.We demonstrate that the synergy of Residual Networks and Deep Incremental Boosting has better potential than simply boosting a Residual Network of fixed structure or using the equivalent Deep Incremental Boosting without the shortcut layers, by permitting the creation of models with better generalisation in significantly less time

Crossref

Birkbeck Institutional Research Online

Fluctuation-Driven Neural Dynamics Reproduce Drosophila Locomotor Patterns.

Author: A Büschges
A Censi
A Destexhe
A Destexhe
A Eldar
A Keller
A Longtin
A Sorribes
AA Faisal
AA Koulakov
AA Prinz
AL Nelson
Aldo A Faisal
Andrea Maesani
BJ Cole
C Montell
CR Reyn von
CS Mendes
D Sussillo
D Sussillo
D Valente
Dario Floreano
E Doi
E Marder
E Marder
EJ Izquierdo
EJ Izquierdo
F Bartumeus
FW Wolf
G Benettin
GJ Stephens
GJ Székely
GL Collingridge
I Tsuda
J Schneider
JL Kaplan
JR Martin
JR Martin
JS Anderson
JS Kain
K Ahnert
K Branson
K Wiesenfeld
K-I Funahashi
KJ Mann
KM Stiefel
Kyle Gustafson
M Ai
M Clerc
MD McDonnell
MM Churchland
NA Dunn
NN Schraudolph
P Coen
P Fatt
P Ramdya
P Ramdya
Pavan Ramdya
PJ Choi
PR Montague
R Cossart
R Poli
RD Beer
RD Beer
RD Beer
RD Beer
RE Ritzmann
Richard Benton
S Grillner
S Grillner
S Hanai
S Strogatz
S Yorozu
SQ Lima
SS Bidaye
Steeve Cruchet
SW Flavell
T Ohyama
TA Markow
TFC Mackay
TI Toth
TJ Prescott
V Mante
VR Cane
X Jin
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2015
Field of study

The neural mechanisms determining the timing of even simple actions, such as when to walk or rest, are largely mysterious. One intriguing, but untested, hypothesis posits a role for ongoing activity fluctuations in neurons of central action selection circuits that drive animal behavior from moment to moment. To examine how fluctuating activity can contribute to action timing, we paired high-resolution measurements of freely walking Drosophila melanogaster with data-driven neural network modeling and dynamical systems analysis. We generated fluctuation-driven network models whose outputs-locomotor bouts-matched those measured from sensory-deprived Drosophila. From these models, we identified those that could also reproduce a second, unrelated dataset: the complex time-course of odor-evoked walking for genetically diverse Drosophila strains. Dynamical models that best reproduced both Drosophila basal and odor-evoked locomotor patterns exhibited specific characteristics. First, ongoing fluctuations were required. In a stochastic resonance-like manner, these fluctuations allowed neural activity to escape stable equilibria and to exceed a threshold for locomotion. Second, odor-induced shifts of equilibria in these models caused a depression in locomotor frequency following olfactory stimulation. Our models predict that activity fluctuations in action selection circuits cause behavioral output to more closely match sensory drive and may therefore enhance navigation in complex sensory environments. Together these data reveal how simple neural dynamics, when coupled with activity fluctuations, can give rise to complex patterns of animal behavior

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Serveur académique lausannois

Directory of Open Access Journals

PubMed Central

FigShare

Online learning with adaptive local step sizes

Author: BA Pearlmutter
L-W Chan
M Riedmiller
N Murata
NN Schraudolph
NN Schraudolph
R Jacobs
R Neuneier
RS Sutton
S Becker
T Tollenaere
Y LeCun
Publication venue: SpringerVerlag
Publication date: 01/01/1999
Field of study

Almeida et al. have recently proposed online algorithms for local step size adaptation in nonlinear systems trained by gradient descent. Here we develop an alternative to their approach by extending Sutton’s work on linear systems to the general, nonlinear case. The resulting algorithms are computationally little more expensive than other acceleration techniques, do not assume statistical independence between successive training patterns, and do not require an arbitrary smoothing parameter. In our benchmark experiments, they consistently outperform other acceleration methods as well as stochastic gradient descent with fixed learning rate and momentum.

CiteSeerX

Crossref

A Formalization of the Natural Gradient Method for General Similarity Measures

Author: G Raskutti
M Agueh
N Parikh
NN Schraudolph
SI Amari
SI Amari
W Li
Y Saad
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Crossref

Copenhagen University Research Information System

A Distributed Parallel Genetic Algorithm: An Application from Economic Dynamics

Author: A Beguelin
B Manderick
C Lemaréchal
D Whitley
H Miihlenbein
H Mühlenbein
JB Taylor
JR Koza
KL Judd
L Booker
NJ Radcliffe
NN Schraudolph
NN Schraudolph
NR Patel
P Jog
PM Beaumont
PM Beaumont
R Bianchini
R Tanese
S Forrest
S Kirkpatrick
W Coleman
W Haan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1996
Field of study

Crossref

Integrating Genetic Algorithms With Systems Dynamics To Optimize Quality Assurance Effort Allocation

Author: D Beasley
DE Goldberg
DE Goldberg
DE Goldberg
G Dworman
GJ Myers
J Koza
J Koza
JH Holland
JJ Grefenstette
JR Koza
JR Koza
JW Forrester
KA De-Jong
M Holsheimer
M Srinivas
NN Schraudolph
NN Schraudolph
R Baskerville
R Dunn
R Pressman
RL Glass
RM Sholtes
S Forrest
S Forrest
S Levy
T Abdel-Hamid
T Abdel-Hamid
T Abdel-Hamid
T Demarco
T Gilb
V Dhar
VR Basili
Z Michalewicz
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2003
Field of study

Crossref